Picture for Qian Liu

Qian Liu

SynSense AG, Swizerland

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Add code
Feb 05, 2026
Viaarxiv icon

Proxy Compression for Language Modeling

Add code
Feb 04, 2026
Viaarxiv icon

Beyond Precision: Training-Inference Mismatch is an Optimization Problem and Simple LR Scheduling Fixes It

Add code
Feb 02, 2026
Viaarxiv icon

Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning

Add code
Dec 28, 2025
Viaarxiv icon

LLM-based Behaviour Driven Development for Hardware Design

Add code
Dec 23, 2025
Viaarxiv icon

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Add code
Dec 14, 2025
Viaarxiv icon

Bridging Minds and Machines: Toward an Integration of AI and Cognitive Science

Add code
Aug 28, 2025
Viaarxiv icon

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Add code
Aug 24, 2025
Figure 1 for TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Figure 2 for TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Figure 3 for TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Figure 4 for TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Viaarxiv icon

Graph Learning for Cooperative Cell-Free ISAC Systems: From Optimization to Estimation

Add code
Jul 09, 2025
Viaarxiv icon

First Return, Entropy-Eliciting Explore

Add code
Jul 09, 2025
Viaarxiv icon